A Comparison of DFT, PLP and Cochleagram for Alphabet Recognition

نویسنده

  • Mark Fanty
چکیده

The English alphabet is a small but dificult vocabulary fo r speech recognition, with many fine phonetic distinctions, such as M / N and B/V. We use speakerindependent classification of isolated English letters t o evaluate the relative performance of the D F T , Perceptual Linear Predictive analysis, and the cochleagram auditory model. Feedforward neural network classifiers were trained using all three representations o n 60 speakers and tested on 60 new speakers. Training and testing data were independently modified b y adding two levels of Gaussian noise and babble (20 random letter utterances, attenuated and given random offsets). PLP gave the best results, especially when trained or tested on Gaussian noise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker-independent Vowel Recognition: Spectrograms versus Cochleagrams

We examined the ability of multi-layer perceptrons (MLPs) trained with backpropagation to classify vowels excised from natural continuous speech. Two spectral representations were compared: spectrograms and cochleagrams. The features used to train the MLPs included DFT or cochleagram coefficients from a single frame in the middle of the vowel, or coefficients from each third of the vowel. We al...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Comparison of Speech Features on the Speech Recognition Task

In the present work we overview some recently proposed discrete Fourier transform (DFT)and discrete wavelet packet transform (DWPT)-based speech parameterization methods and evaluate their performance on the speech recognition task. Specifically, in order to assess the practical value of these less studied speech parameterization methods, we evaluate them in a common experimental setup and comp...

متن کامل

DFT Study and Comparison between B6C4Si and C16 Clusters as a Vitamin C Carrier

In this study the chemical properties of B6C4Si and C16 Clusters connected vitamin C have been investigated using density functional theory (DFT). NMR parameters and HOMO- LUMO Gap energy are calculated by using density functional method (B3LYP) with 6-311G* basis set. Calculations show that HOMO- LUMO Gap energy of vitamin C decreases after connecting to B6C4Si or C16 cluster decreasing of HOM...

متن کامل

Robustness of Phoneme Classification Using Support Vector Machines: a Comparison between Plp and Acoustic Waveform Representations

Robustness of phoneme recognition to additive noise is investigated for PLP and acoustic waveform representations of speech using support vector machines (SVMs) combined via error-correcting code methods. While recognition in the PLP domain attains superb accuracy on clean data, it is significantly affected by mismatch between training and testing noise levels. The classification in the high-di...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004